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(54) Title: APPROACHES FOR HPV DETECTION AND STAGING BY TARGETING THE E6 GENE REGION OF THE VIRAL 
GENOME 

(57) Abstract: The Ll/El gene region of the HPV virus may be deleted during integration into the genome of the host cell, but the 
E6/E7 gene region is always retained. There is a need to detect HPV infection and cervical cancer in way that provides information 
about the stage of infection so that the proper treatment can be undertaken. 
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APPROACHES FOR HP V DETECTION AND STAGING BY 
TARGETING THE E6 GENE REGION OF THE VIRAL GENOME 
FIELD OF THE INVENTION 
Integration of the E6 region of human papillomavirus into the cellular 
5 DNA is an important step in the progression to malignancy. This invention involves 
methods for detecting human papillomavirus in cervical cells and determining the 
progression of the infection. 

BACKGROUND OF THE INVENTION 
Infection by HPV involves the passage of the viral DNA into a cell. The 
10 HPV viral genome can be divided into 3 regions, upstream regulatory region (URR) or 
long control region (LCR), the early gene region and the late gene region. These 
regions control sequences for HPV replication and gene expression, encoding the E2, 
E6 and E7 genes, and encoding the LI and L2 genes respectively (Turek, Adv. Virus 
Res. 44:305-356 (1994)). Initially at least the circular HPV DNA remains free inside 
15 the cell in an episomal form. Whereas the episomal form predominates early in 
infection, this situation may change later, with the subsequent occurrence of 
integration. Although initially some episomal HPV DNA remains along with the 
integrated HPV DNA in an infected cell, ultimately, in a significant proportion of 
cancers the integrated form not only dominates, but represents the only HPV DNA 
20 present. The evidence suggests that integration may be an important step in the 
progression to malignancy. 

The viral genomes are exclusively maintained as episomes in benign 
lesions induced by HPV types such as 6 and 1 1 (Dowhanick et al, Suppression of 
cellular proliferation by the papillomavirus E2 protein. J. Virol 1995; 69:7791-7799; 
25 Kobayashi et al., Presence of human papillomavirus DNA in pelvic lymph nodes can 
predict unexpected recurrence of cervical cancer in patients with histologically 
negative lymph nodes. Clin. Cancer Res 1998; 4:979-83). Only episomal HPV is 
detected in CIN I and integrated sequences are rarely found in CIN II and CIN III 
(Cullen et al. Analysis of the physical state of different human papillomavirus DNAs 
30 in intraepithelial and invasive cervical neoplasms. J. Virol, 1991; 65:606-612; Das et 
al. "Analysis by polymerase chain reaction of the physical state of human 
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papillomavirus type 16 DNA in cervical preneoplastic and neoplastic lesions. J. Gen. 
Virol. 1992; 73:2327-2336). In contrast, the viral DNA is usually integrated into the 
cellular genome in cell lines derived from cervical carcinomas (Boshart et al., "A New 
Type of papillomavirus DNA, its presence in genital cancer biopsies and in cell lines 
5 derived from cervical cancer. EMBO J. 1984 3: 1 151-1 157; Howley, P.M. Presence 
and expression of human papillomavirus sequences in human cervical carcinoma cell 
lines. Am J. Pathol 1985 1 19:361-366; and Tsunokawa et al., "Presence of human 
papillomavirus type-16 and type-18 DNA sequences and their expression in cervical 
cancers and cell lines from Japanese patients. Int. J. Cancer 1986:37:499-503; and Yee 

10 et al."Presence and expression of human papillomavirus sequences in human cervical 
carcinoma cell lines. Am. J. Pathol. 1985: 119:361-6) . This suggests that integration 
begins early in cancer development and is an important event in malignant 
transformation (Bosch et al. "Prevalence of human papillomavirus in cervical cancer: a 
worldwide perspective . J. National Cancer Inst. 1995; 87:796-802; Cullen, et al. 

15 Analysis of the physical state of different human papillomavirus DNAs in 

intraepithelial and invasive cervical neoplasms. J. Virol 1991; 65:606-612; and Vem<___ 
et al. "Association of human papillomavirus type 16 integration in the E2 gene with 
poor disease-free survival from cervical cancer. Int. J. Cancer 1997: 74:50-56). Thus 
the accurate detection of the LI versus E6 status of the HPV DNA may be very 

20 important in determining cervical cancer progression and in assisting in clinical 
management of those women. 

Integration was first described by the Heidelberg group of zur Hausen 
and Gissman (Durst et al.J. Gen. Virol. 1985, 66 (Pt 7): 1515-22; Schwarz et al. Nature 
1985; 314:1 1 1-4). For HPV 16, cancer cells displayed both integrated and episomal 

25 forms, with the integration occurring as head-to-tail arrays (Durst et al. J. Gen. Virol. 
1985, 66 (Pt 7) 1515-22). In the case of HPV 18, Schwarz et al. Nature 1985,314:111- 
4) found that in 3 cell lines, the HPV DNA was integrated and amplified. A mixture of 
integrated and episomal HPV16 in carcinomas was also found by Fukushima et al. 
Cancer 1990, 66:2155-61). Yiu et al. Oncogene 1991; 6:1339-42, noted that of 9/15 

30 cervical carcinomas with HPV 16 DNA, 44% had integrated, 44% had episomal and 
1 1% both. Matsukura et al. Virology 1989: 172:63-72 (1989) found both integrated 
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and episomal HPV16 in 34 invasive cervical cancers. Eight of these exhibited only the 
integrated form of HPV DNA. 

In a study of cervical cancers by Park et al. Gynecol. Oncol. 1997; 
9:267-76, HPV16 orl8 DNA was found to predominate, being present in 75% of 68 
' 5 cases, with 7% having episomal only, and 18% a mixture of each. It was apparent to 
these workers that a difference existed between HPV 16 and HPV 1 8 in that, of the 51 
with HPV 16, 71% had only integrated HPV DNA, 20% had both integrated and 
episomal and 10% episomal only. In contrast all 17 HPV 18-containing cancers 
reveak^f'only integrated HPV DNA. This is a significant observation, given claims of 
10 the aggressive, rapidly progressing nature of HPV 18-associated dysplasias, and is 
highly relevant to screening tests. 

An overview of the various published studies by Pfister & Fuchs, 
Dermatol. Clin. 1991; 9:267-76 concluded that 36-71% of cervical cancers had just 
episomal HPV DNA, cancers in which the HPV DNA present was completely in the 
15 integrated form were 22-39% of the total, and those with both episomal and integrated 
HPV DNA comprised 6-25%. 

Early observations for dysplasias with HPV 16 infection showed 
evidence of integration in 86%, i.e, integration occurs in the precancerous stage of 
HPVs clinical course (Shirasawa et al. J. Gen. Virol 1986, 67:2011-5). 
20 Fukushima et al. Cancer 1990; 66:2155-61 looked at CIN samples as 

well and found that of 7 positive for HPV 16, 3 contained only integrated HPV16 
DNA. 

In a study of different stages of the carcinogenic process by Cullen et al. 
J. Virol. 1991; 65:606-612 it is stated that integration is a characteristic of malignant 
25 lesions. Of 100 CIN biopsy specimens they found 3% with integrated HPV. However, 
for 69 carcinomas, 81% displayed integration. The most common HPV was type 16 
(40/69 = 58%). Of these, 72% contained integrated HPV DNA and for 27% the 
HPV16 DNA was exclusively episomal. For the specimens that had integrated HPV 
DNA, in 80% of cases this was the only form in which the viral DNA was present in 
30 the cell. Only 20% had episomal as well as integrated HPV DNA. 

Cullen et al J. Virol. 1991; 65:606-612 also found in the case of 
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HPV18, which has greater transforming efficiency that all 23 carcinomas had 
integrated HPV DNA, with only 1 having episomal DNA as well. 

In the early work by Schwarz et al. Nature 1985;324:1 1 1-4 deletion of 
up to 2-3 kb of DNA was found in cell lines containing HPV DNA. This included the 
5 E2 to L2 region. Importantly, E6-E7 transcripts could be detected in these cells. 

Subsequent work by Shirasawa et al. J. Gen. Virol., 68:583-91, 1987) 
on cervical carcinoma cell lines found that open reading frames (ORFs) El, E2, E4 and 
E5 were interrupted by flanking host cell DNA suggesting that the integration into host 
cell DNA was occurring preferentially in these regions of the HPV genome. They 

1 0 found that HPV mRNAs hybridized with the entire E6 and E7 ORF and a minor part of 
the El ORF, meaning that these were the only portions of the HPV genome present. 
No hybridization to LI and L2 ORFs could be detected, implying that these regions of 
the virus had been deleted. 

In a study of 6 cervical carcinoma samples Choo et al. (Virol. 1987: 

15 161:259-61 confirmed that on integration, the E6/E7 region is retained, but found that 
the E2 region is lost. 

Matsukura et al J. Virol. 198658:979-82 found most of El and all of E2 
to be deleted in a cervical carcinoma containing HP VI 6 DNA. Interestingly, in one 
study of a cell line, the integration that disrupted E2 and L2 was found to have 

20 occurred in the premalignant lesion from which the line was derived 

(Schneider-Maunoury et al. 1 987, J. Virol : 61 :3295-8). Wagatsuma et al. J. Virol 
(1990) 64:813-21) found 4 clones of integrated HPV16 DNA that had deletions in the 
E1/E2 and the L1/L2 regions, and they state that no site specific for integration is 
present in the viral sequence. Rather, during integration, viral sequences are opened 

25 within any ORFs except the E6/E7 ORFs and locus control region. Jeon et al. J. Virol. 
1995; 69:2989-97 showed that integration of the viral genome into the human 
chromosome in the cancer cells usually disrupts or deletes the E2 ORF, which results 
in the loss of expression of the E2 gene. This was associated with the expression of 
high levels of E6 and E7 being maintained (Jeon et al., J. Virol 1995; 69:2989-97). 

30 Mutations of HPV DNA take place in the LI region of the HPV 

genome (but never the E6 region). In a detailed analysis of DNA from an 
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HPV16-positive cervical carcinoma only 3091 bp of the original 7905 bp viral genome 
remained (Cone et al. J. Med. Viroll992 37:99-107). This included the E6/E7 region. 
Moreover, whereas the E6 and E7 ORFs showed complete concordance with the 
published sequences (which also supports the role of E6/E7 in tumorigenesis), there 
5 were multiple mutations (transversions, transitions, small deletions, and small 

insertions) in the remaining integrated HPV 16 DNA, which was composed of parts of 
the LI and El ORFs. (The 5 1 end of LI was missing, the integrated DNA beginning at 
nt 6334). The mutations included: a single base deletion at nt 6387; insertion of 
nucleotides CAT at position 6901 (which has also been noted by others, viz. Baker et 

10 al. J. Viroll987 61:692-71; Choo et al., J. Virol 1988 62:1659-66; Matsukura et al. J. 
Virol. 1986; 58:979-82); and deletion of GAT at nt 6949. 

Integration of HP VI 6 DNA has in fact been found to lead to increased 
steady state mRNA encoding the viral oncogenes E6 and E7 as a consequence of 
increased stability conferred by disruption of A+U rich sequence in 3'-UTR of E6 and 

15 E7 mRNAs and replacement with cellular sequences with lower A + U content (Jeon 
& Lambert, L Virol. 1995; 69:2989-97). This would account at least in part for the 
higher concentration of E6 and E7 proteins in clonal populations with integrated 
HPV 16 DNA compared with ones in which the DNA is present exclusively in an 
episomal form (Jeon et al. J. Virol.1995; 69:2989-97). The cells with integrated DNA 

20 therefore outgrow those with episomal HPV DNA only, i.e. integration provides a 
growth advantage (Jeon et al. J. Virol. 1995; 69:2989-97). 

It is widely known that the E6 product binds p53, an important negative 
regulator of the cell cycle, and in so doing inhibits its activity, so leading to 
uncontrolled growth. This offers a biological basis for oncogenesis. Similarly the E7 

25 protein attaches to another crucial cell cycle regulator, the retinoblastoma binding 

protein. It is also known that the E2 gene encodes a site-specific DNA-binding protein 
that is involved in the regulation of the HPV promoter that directs E6 and E7 
expression (Romanczuk et al. J. Virol, 1990; 64(6):5240-9; Thieny & Howley, 1991 
New Biol; 3:90-100; Bernard ct al., J. Virol.1989; 63:4317-4324). 

30 As far as cervical cancer is concerned what matters is the E6 and E7 

region. The rest of the genome may have had a role in transmission of the virus to a 
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new host cell, but it is the proteins encoded by E6 and E7 that cause the cancer. 
Integration of HPV DNA occurs early in cancer development and is important as an 
activation mechanism for progression from precancer lesions to cancer lesions (Bosch 
et al. J. Natl. Cancer Inst. 1995; 87:796-802; Cullen et al. J. Virol 1991 65:606-612; 
5 Vemon et al. Int. J. Cancer 1997:74-50-56). 

Targeting the available E6 region rather than the deleted LI or L2 
regions would appear to be the best way to ensure that the critical DNA in cancer 
causation is not missed. This is consistent with the fact that the LI and L2 regions of 
the HPV genome can be deleted during integration into the genome of the host cell, but 

10 the E6/E7 region is always retained. Approaches targeting the LI and L2 regions 
therefore would appear to be inferior. Since deletion or mutation can involve 
specimens at an advanced stage of abnormality, the resulting false negative result 
could have fatal consequences. In contrast, primers that detect high risk types of HPV 
by the targeting the E6 region should not miss one of these high risk HPV infections. It 

15 is just a matter of ensuring that the relevant primers are included in the E6 PCR test 
that is performed. 

A popular approach for detecting HPV in a cell involves using primers 
directed at the LI region of the virus. See US Patents 5,182,377; 5,283,1 71 and 
5,447,839. However the approach of directing primers at the v Ll region has serious 

20 disadvantages and directing the primers at the E6 region is preferable in the clinical 
setting. Integration can occur in the LI region but not the E6 region. 

Another strategy involves PCR targeted to the cancer-causing E6 part of 
the virus (Morris & Nightingale 1987; Morris et al., 1988, 1990; and Dallas et al., 
1989) because this region shows the greatest differences in nucleotide sequence 

25 between different HPV types. This makes it possible to design type- or clade-specific 
primers in order to not only determine if HPV is present, but at the same time 
distinguish high- from low-risk HPV types. HPV groupings based on the sequence 
homology between the E6 regions of HPVs correlate with clinical significance 
(Lorinez et al. 1992), leading to support for risk-group specific primers. The value of 

30 this approach has been confirmed (Fujinaga et al. 199 1) and offers a direct means of 
getting the kind of information that is needed for clinical decision making. It is likely 
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that all of the major HPV types are now known, so that suitable primers directed at the 
E6 region of the various types, and capable of telling whether a woman is infected with 
a high-risk HPV, can be synthesized and readily incorporated into the same reaction 
tube. This makes the chance of missing an infection very unlikely indeed. Costs are 
5 also lower, since fewer steps are needed. This makes E6 testing more attractive in 
widespread screening. 

This approach formed the basis for the original PCR for HPV detection 
(Morris & Nightingale 1987; Morris et al 1988, 1990; Dallas et al. 1989). The idea of 
targeting the PCR primers to the E6 region was based on a number of important facts, 

10 not the least of which was the well-recognized oncogenic role of E6 in cancer. Primers 
were designed against sequences within the E6 region that were conserved between 
different high risk types. Other common primers were designed that would only 
hybridize within the E6 region of low-risk types. Thus there were primers that would 
hybridize to the most common low-risk HPV types (HPV6 and HPV 1 1) and others 

15 that would hybridize to the most common high-risk types (HPV 16 and HPV 18). By 
ensuring that PCR products of different sizes would emanate from this choice the test 
could discriminate high- and low risk HPV groups. For low-risk types (HPV6 and 
HPV1 1) the size of the band seen on electrophoresis was ~ 120 bp. For HPV16 (and 
HPV33, which could be detected using the same primers) it was -200 bp and for 

20 HPV18 it was - 100 bp. Confirmation of type was made by type-specific primers, 

although these were not necessary for routine screening. For the residue of rarer HPV 
types not covered by the test developed originally, other primers could be readily 
designed using the same principal, and added into the mixture of primers in the PCR. 

As stated, there is evidence that support the concept that integration of 

25 the HPV viral genome into the human chromosome may delete or disrupt the LI and 
other gene regions, whilst maintaining high levels of E6 and E7 expression (Armstrong 
& Holman, 1981; Figge et al. 1970). This deletion or disruption results in the loss of 
expression of the LI gene region, causing an absence of the LI gene region sequences 
following integration. Thus, though the targeting of the El or LI region has the 

30 potential to detect many, perhaps all, HPV types detection will not occur if the virus is 
present in an exclusively integrated form. It is known that the target for PCR will not 
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always be retained if it is in LI and other regions besides E6 and E7. On the other hand 
targeting the E6 or E7 regions will detect all HPV-positive patients, provided that 
primers for all relevant high-risk HPV types are used. That the method of detection 
strongly influences the rate of HPV detection was emphasized by Noffsinger et al. 
5 (1995). In a study of anal carcinomas they showed that the least sensitive method for 
HPV detection is PCR using LI consensus primers (rate=16%). On the other hand, 
type-specific primers directed at the E6 region yield a positiviity of 46% in anal 
carcinomas. 

LI PCR probably detects the majority of cases with HPV types 6 or 1 1, 

10 but underestimates HPV 16 infections, A suggestion that this is because of 

fragmentation of template DNA in formalin-fixed tissues (Park et al., 1991) is 
unlikely, as others have consistently been able to amplify HPV DNA from such 
specimens (Noffsinger et al. 1995). Even though LI PCR products can be 450 bp, 
type-specific primers give 380-440 bp PCR products from formalin-fixed speciments. 

15 The more likely explanation was said to be loss of HPV genomes upon integration, in 
particular deletion of LI and L2 (Noffsinger et al. 1995). Transcripts from these 
regions are indeed lost in HPV 16 infected cells (Stoler et al, 1992). The loss of late 
region genes is further supported by the rarity of positivity for viral capsid proteins in 
intra-epithelial and invasive anogenital neoplasia (Schwarz et al., 1985; Durst et al., 

20 1989). Thus PCR strategies that rely on the presence of late genes may significantly 
underestimate the number of cases that are HPV positive. In contrast, the E6 and E7 
genes are highly conserved (Cone et al, 1992; Stoler e al., 1992, Baker et al., 1987; 
Wagatsuma et al., 1990) and Noffsinger et al., (1995) state that HPV infection should 
therefore be detectable in almost all cases where viral DNA is present. 

25 It is known in the art to target so-called "consensus" primers to the 

highly conserved LI region of the HPV genome. An extra step is required such as 
probing with radioactive- or biotin-labelled type-specific oligonucleotides in order to 
address the important question of type of HPV present. 

Consensus primer sets include those directed at various parts of the LI 

30 region, viz. ones that have been termed by those who developed them My09-Myl 1 , 
Gp5-Gp6, Gp5+-Gp6+, and oli-lb-oli-2I, and those directed at the El region, viz- 
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CpI-CpIIG (see Karlsen et al. 1996 for an overview). The MY09/MY07 (MY-PCR) 
primer set (Manos et al. 1989) and the GP5+/GP6+ (GP+-PCR) primer set (de Roda 
Husman et al. 1995) are currently the most commonly used primer sets for HPV 
detection in clinical samples. The latter are a modification (extension in length) of an 
5 earlier version, GP5/GP6, which were designed to amplify the nt 6624-6765 region of 
the HPV genome to yield a 140-1 50 bp product (Snijders et al. 1990). The MY-PCR 
set amplify the nt 6582-7033 region to yield a 450 bp product (Manos et al. 1989). 
The MY-PCR is used more in America and Asia, and the GP+-PCR in Europe, 
reflecting the geographical locality where each set was developed. The MY-PCR 

10 primer set is synthesized with several degenerate nucleotides in each primer and is thus 
a mixture of 25 primers capable of amplifying a wide spectrum of HPV types (Manos 
et al. 1989; Hildesheim et al. 1994). In contrast, there are only two primers in the 
GP+PCR set and detection of a broad range of HPVs is achieved by using a lowered 
annealing temperature during PCR (de Roda Husman et al. 1 995). For El , two 21 

15 mers have been described that amplify within this conserved region of HPVs tested 
(Gregoire et al., 1999). 

In a comprehensive study that addresses this issue, Karlsen et al. (1996) 
used a range of consensus primer sets, including those directed at various parts of the 
LI region, viz. My09-Myll, Gp5-Gp6, Gp5+-Gp6+ and oli-lb-oli-2I and those 

20 directed at the El region, viz CpI-CpIIG. By testing with all of these primer sets, as 
well as primers directed at the E6 or E7 region, 98% of 355 biopsy specimens from 
patients with invasive cervical carcinomas were found to be positive for HPV. 
However, use of just one gave a much lower rate of detection. It is interesting to note 
that type-specific primers (for HPV1 1, 16, 18, 31, 33 and 35) detected more HPV- 

25 infected patients than the most sensitive consensus primer set. In fact it was necessary 
to use several consensus primer sets together (viz. The My/Gp/Gp+ and Cp sets) in 
order to detect a high number of HPV-positive patients. Moreover, based on results 
using consensus primers, LI deletions were present in 23 of 56 (41%) samples. The 
data argued strongly against the reliability of using LI consensus primers alone. 

30 Deletions mean not all consensus primers will have hybridization targets in the HPV 
DNA leading to the conclusion that a combination of consensus primers must be 
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included in any PCR test to have any hope of detecting all or more HPV present in a 
specimen (Karlsen et al. 1996). There appears to be agreement that use of either the 
commonly used MY -PCR or GP+-PCR methods alone will underestimate the true 
prevalence of HPV in cervical samples (Smits et al. 1995; Karlsen et al. 1996). 
5 In other studies, involving 635 cervical cancer samples from Spain, 

Colombia, and Brazil (i.e. 436 in a report by Munoz et al., 1992 and 199 cases in a 
study by Eluf-Neto et al., 1994), HPV was detected in just 85% of samples using only 
one consensus primer set. Even lower rate of detection emerged from a study involving 
the MY LI primers, which gave only 69% positivity (Guerrero et al. 1992). 

10 Since integration is an important step in the progression to malignancy, 

the spectre of missing HPV during testing by PCR if the primers are directed at a 
mutated or variable region, even if this region is not deleted. That is, one has to worry 
about both deletion and mutation if targeting the LI region using a consensus PCR. 
This is least likely to happen for the E6 or E7 region, since any mutation could have 

15 functional implications, most likely deleterious to viral oncogenic function so that the 
ramifications of failure to detect are likely to be inconsequential. In reality, there is 
"extraordinary conservation of the E6/E7 DNA sequence" (Cone et al., 1992) meaning 
that primers directed at the E6 region have a very much greater likelihood of annealing 
than ones targeting the LI region. The implication of these findings is that whereas 

20 screening in women with C1N will probably pick up more HPV infections when LI 
primers are used for PCR, some women with more advanced lesions may be missed 
because of the loss of LI during integration. And it is these women in particular who 
must be correctly diagnosed, since their need for treatment is greater and has to be 
instituted as a matter of priority. 

25 Approaches targeting the LI are therefore inferior. Since deletion or 

mutation can involve specimens at an advanced stage of abnormality, the resulting 
false negative result could have fatal consequences. In contrast, methods that detect 
high vs low risk groups of HPV by targeting the E6 region should never miss an 
infection. 

30 Thus targeting the available E6 region rather than the deleted LI region 

is the best way to ensure that the critical DNA in cancer causation is not missed. 
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Targeting the E6 or E7 region will detect all HPV-positive patients provided that all 
relevant high-risk HPV types are screened for. 

WO 99/29890 discloses methods for monitoring the stages of HPV- 
induced diseases by measuring the expression levels of mRNA from the E6, E7, E2, 
5 El regions, and determining the ratio of the expression level of E6 and/or E7 to LI 
and/or E2. 

The presence or absence of a particular form of HPV DNA may be 
indicative of the cervical cell state. That is there is likely to exist a continuum of HPV 
DNA forms that exist in parallel with cervical cell disease. Specifically, an episomal 
10 HPV infection may be considered representative of early stage HPV infection of CIN I 
with integrated HPV infection being considered representative of cancer. The 
determination of the presence or absence between the two ends of this spectrum may 
provide an indication of the stage of HPV-based cervical disease. 

Therefore, it is an object of this invention to detect the presence or 
15 absence of the E6, E7 or E6/E7 regions of HPV and the presence or absence of the LI, 
L2 or L1/L2 region of HPV in a sample of cervical cells. 

It is a further object of this invention to use these findings to assess the 
risk of developing HPV-based disease and/or to determine the stage of infection. 

SUMMARY OF THE INVENTION 
20 As the E6/E7 gene regions are retained and the L1/L2 regions are 

deleted in the process of disease progression, measuring of the relative amounts of 
DNA, RNA or expression of the E6 and/or E7 regions and LI and/or L2 regions of 
HPV infected cells provide: 

1. A method for assessing the stage of HPV-based disease; 
25 2. A method for assessing the risk that a woman with HPV infection 

will develop HPV-based disease; and 

3. A method for categorizing/staging women with HPV infection but 
without detectable HPV-based disease into those at risk for progression to disease and 
those not at risk of progression to disease. 
30 DETAILED DESCRIPTION OF THE INVENTION 

There are methods known in the art for detecting DNA and RNA. A 
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number of methods are particularly suited for detecting viral DNA or RNA. Any 
method that can be used to detect the presence or absence of the E6 and/or E7 and LI 
and/or L2 regions of the HPV genome can be used in the method of this invention. 

Methods for detection of HPV DNA or RNA include but are not 
5 limited to polymerase chain reaction (PCR) (See US Patents 4,683,195; 4,683,202; 
4,800,159; 4,965,188; 5,008,1825,176,995; 5,182,377; 5,283,171, and WO 88/06634); 
light cycla, Taq Man, Q-beta replicase, ligase chain reaction, PCR-Elisa and NASBA 
DNA detection. 

PCR can be used with ELISA (enzyme-linked immunosorbent assay) to 

1 0 identify DNA sequences in the E6/E7 and L1/L2 regions of HPV. 

The ligase chain reaction is a DNA amplification technique which is a 
cyclic two-step reaction: 1) a high-temperature melting step in which double-stranded 
target DNA unwinds to become single-stranded and 2) a cooling step in which two sets 
of adjacent, complementary oligonucleotides anneal to the single-stranded target 

15 molecules and ligate together. The products of the ligation from one cycle serve as 
templates for the next cycle's ligation reaction. Amplification is achieved in a similar 
manner to PCR. See Weiss, (1988) Science 254, 1292-1293; Landegren, U. et al. 
(1988) Science 241:1077-1080; Barany, F. (1991) PCR Methods and Applications 1:5- 
16 and Marsh, E., et al. (1992) Strategies 5:73-76. 

20 Q-beta replicase is an isothermal nucleic acid amplification system 

which uses the enzyme Q-beta replicase. See U.S. Patents 5,556,751 and 6,004,747. 

Taq Man involves the utilization of an additional oligonucleotide in the 
standard PCR reaction. The additional oligonucleotide hybridizes to a region between 
the forward and reverse oligonucleotides. The additional oligonucleotide is conjugated 

25 to a fluorescence molecule on the 5* end and a quenching molecule on the 3' end. 

When these two molecules are in proximity the fluorescence is quenched. When the 
Taq polymerase encounters this oligonucleotide it chews it off releasing the fluor and 
the quencher causing a change in fluorescence: The oligonucleotide probe that is 
specific for the target to be amplified is labelled with a fluorescent tag and a quenching 

30 molecule. During the extension step of PCR the Taq enzyme will disrupt probe bound 
to the target separating the fluorescent tag from its quencher molecule thus permitting 
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fluorescence. In another approach an oligonucleotide probe containing a reporter . 

molecule-quencher molecule pair that specifically "anneals to a region of a target 

polynucleotide downstream i.e. in the direction of extension of primer binding sites. 

The reporter molecule and quencher molecule are positioned on the probe sufficiently 
5 close to each other such that whenever the reporter molecule is excited, the energy of 

the excited state nonradiatively transfers to the quencher molecule where it either 

dissipates nonradiatively or is emitted at a different emission frequency See US 

Patents 5,210,015 and 6,030,787. 

NASBA (Nucleic acid sequence based amplification ) is an isothermal 
10 RNA amplification method using reverse transcriptase. NASBA is continuous rather 

than cyclic which means it measures all at once instead of waiting for a series of copies 

to be made. Quantitative detection is achieved by way of internal calibrators which are 

added at isolation and are co-amplified and subsequently identified along with the wild 

type of RNA using electrochemiluminscence. 
15 These methods and others known in the art can be used to detect DNA 

and/or RNA and/or expression of genes of HPV. 

The DNA sequences of strains of HPV are known. Known HPV include 

HPV la, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 

25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 47, 48, 
20 49, 50, 51, 52, 53, 54, 55, 56 , 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 72, 

73, 74, 75, 76, and 77. The DNA sequences of these viruses can be found in the 

GenBank database. 

The sequences of HPV 6, 1 1, 16, 18, and 33, are also described in the 

Following references. 

25 The sequence of HPV6 is given in Schwartz et a!., EMBO L 2:2361-8, 

1983. 

The sequence of HPV11 is given in Dartman et al. Virology 151:124- 

30, 1986. 

The sequence of HPV16 is given in Seedorf et al. Virology 145:181-5, 

30 1985. 

The sequence of HPV 18 is given in Matlashewski et al., J. Gen. Virol. 
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67:1909-16,1986. 

The sequence of HPV33 is given in Cole and Streeck. J. Virol 58:991-5, 

1986. 

The methods described above can be used to detect the presence or 
5 absence of the E6, E7 or E6/E7 region and the presence or absence of the LI, L2 or 
L1/L2 region. The presence or absence can be detected by using a method that detects 
or measures DNA, RNA or expression of the gene. The methods can also be used to 
determine whether the DNA, RNA or gene product is from a high risk or low risk 
strain of HPV. If it is determined that the DNA, RNA or gene product is derived from 
10 a high risk strain of HPV, the presence or absence of DNA, RNA or gene product 
from the LI or L2 region will provide the clinician with important information about 
the progression of the infection. If the LI or L2 regions are present then it is likely that 
the cell has not completed its transformation to the malignant state. If no LI or L2 
regions are present and only the E6, E7 or E6/E7 regions can be detected then the cell 
15 has been transformed to a malignant state and clinical intervention and further testing 
and treatment is warranted. 

If integration occurs in the precancerous stage and this occurs in the LI 
gene region, and if the E6 region is not integrated and remains detectable in the 
advancing stages of carcinogenesis, then combined L1/E6 PCR testing provides a 
20 diagnostic/prognostic indicator of progression. 

Tables 1-3 show clinical scenarios: 

1. LI negative and E6 negative = negative HPV 

2. LI positive and E6 positive = early disease state 

3. LI negative and E6 positive = advanced disease state 
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TABLE1 
INTEGRATION/DELETION SITES 



Study 


Integrated 
Deletion Sites 


Retention Sites 


Schwartz etal(1985) 


E2 to LI 


E6, E7 


Shirasawaetal(1987) 


E1.E2, E4, E5, L1.L2 


E6, E7 


Choo et al (1987) 


E2 


E6, E7 


Matsukura et al (1986) 


E1.E2.L2 


? 


Waggatsuma et al (1990) 


E1,E2, L1,L2 


E6, E7 


Jeonetal(1995) 


E2 


E6, E7 


Yoshinoiichi et al HOOH 


F.7 


Ffi 



TABLE 2 

FREQUENCY OF HPV16 IN CERVICAL CANCERS 
BY L1/E6 CONSENSUS PRIMERS COMPARED TO E6-E7 SPECIFIC PRIMERS 

15 



Reference 


Number of sample 


Ll/El 


E6/E7* j 


Van den Brule et al. (1990) 


21 




84% 


Ter Maulenet al. (1992) 


53 




38% 


Guerrero et al. (1992) 


302 




48% 


Prussia et al. (1993) 


20 




65% 


Eluf-Neto et al (1994) 


186 




54% 


Monk et al. (1994) 


218 




44% 


Williamson et al. (1994) 


68 




46% 


Karlsen et al. ( 1995) 


143 




63% 



25 
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TABLE 3 

VARIABILITY IN RATE OF DETECTION OF HPV IN CARCINOMA 
SAMPLES BY LI /El CONSENSUS PRIMERS IN DIFFERENT STUDIES 



— 




Target and primers J 


Reference 


Number of samples 


LI 










My 


Gp ' 


Cp 


Van den Brule et al. (1990) 


21 




91% | 


1- 


1 Ter Maulen et al. (1992) 


53 




89% I - 


Guerrero et al. (1992) | 


302 


69% 


- 1- 


| Prussia et al. (1993) 


20 




||90% 


| Eluf-Neto et al. (1994) 


186 




84% ||- 


iMonketal. (1994) 


218 


79% 


- 1- 


I Williamson etal. (1994) 


68 


81% 


- II- 


Karlsen et al. (1995) 


143 


91% 


9% 


| 89% 


Herrineton et al. (1995^ 


114 


54% 


- 


I- 



15 
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CL AIMS 

1 . A method for determining the progression of human papillomavirus 
infection which comprises detecting the presence or absence of a E6, E7 or E6/E7 
region of the human papillomavirus and the presence or absence of a LI, L2 or L1/L2 

5 region of the human papillomavirus wherein the absence of the E6, E7 or E6/E7 region 
and the absence of the LI, L2 or L1/L2 region signifies no human papillomavirus 
infection; the presence of the E6, E7 or E6/E7 region and the presence of the LI, L2 or 
L1/L2 region signifies an early stage human papillomavirus infection and the presence 
of the E6, E7 or E6/E7 region and the absence of the LI , L2 or L1/L2 region signifies a 
10 late stage human papillomavirus infection. 

2. The method according to claim 1, wherein polymerase chain reaction is 
used to detect the presence or absence of the E6, E7 or E6/E7 region and the presence 
or absence of the LI , L2 or L1/L2 region. 

3. The method according to claim 1, wherein polymerase chain reaction- 
15 enzyme linked immunosorbent assay is used to detect the presence or absence of the 

E6, E7 or E6/E7 region and the presence or absence of the LI, L2 or L1/L2 region. 

4. The method according to claim 1 , wherein ligase chain reaction is used 
to detect the presence or absence of the E6, E7 or E6/E7 region and the presence or 
absence of the LI, L2 or L1/L2 region. 

20 5. The method according to claim 1, wherein Q-beta replicase 

amplification is used to detect the presence or absence of the E6, E7 or E6/E7 region 
and the presence or absence of the LI, L2 or L1/L2 region. 

6. The method according to claim 1, wherein PCR and Taq man are used 
to detect the presence or absence of the E6, E7 or E6/E7 region and the presence or 

25 absence of the LI , L2 or L1/L2 region. 

7. The method according to claim 1, wherein nucleic acid sequence based 
amplification is used to detect the presence or absence of the E6, E7 or E6/E7 region 
and the presence or absence of the LI , L2 or L1/L2 region 

8. The method according to claim 1 , wherein the presence or absence of 
30 the E6, E7 or E6/E7 region and the presence or absence of the LI, L2 or L1/L2 region 

is determined by detecting the presence or absence of DNA from the E6, E7 or E6/E7 
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region and the presence or absence of DNA from the LI, L2 or L1/L2 region. 

9. The method according to claim 1, wherein the presence or absence of 
the E6, E7 or E6/E7 region and the presence or absence of the LI, L2 or L1/L2 region 
is determined by detecting the presence or absence of RNA coded by DNA from the 

5 E6, E7 or E6/E7 region and the presence or absence of RNA coded by DNA from the 
LI, L2 or L1/L2 region. 

10. The method according to claim 1, wherein the presence or absence of 
the E6, E7 or E6/E7 region and the presence or absence of the LI, L2 or L1/L2 region 
is determined by detecting expression of the E6, E7 or E6/E7 region and the presence 

10 or absence of expression of the LI, L2 or L1/L2 region. 

1 1 . The method according to claim 1, wherein the human papillomavirus is 
selected from the group consisting of la, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 
16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 
39, 40, 41, 42, 43, 44, 45, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56 , 57, 58, 59, 60, 61, 

15 62, 63, 64, 65, 66, 67, 68, 69, 70, 72, 73, 74, 75, 76, and 77. 

12. The method according to claim 1 , wherein the human papillomavirus is 
selected from the group consisting of human papillomavirus 6, 11, 16, 18, 31, 33, 35, 
39, 41, 42, 43, 44, 49, 50, 52, 54, 55, 56, and 68a. 

13* The method according to claim 1, wherein the human papillomavirus is 

20 selected from the group consisting of human papillomavirus 6, 11, 16, 18 and 33. 
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